Model Selection

Large Model Inference

# Large Model Inference

Medgemma 27b Text It 4bit

MedGemma-27B-Text-IT-4bit is an MLX-format model converted from Google's MedGemma-27B-Text-IT model, specifically optimized for medical and clinical reasoning tasks.

Large Language Model

Parakeet Tdt 0.6b V2 Onnx

NVIDIA Parakeet TDT 0.6B V2 is a model based on automatic speech recognition (ASR) tasks, suitable for English speech-to-text tasks.

Speech Recognition English

rank1-32b is an information retrieval reranking model based on Qwen2.5-32B, which judges relevance by generating reasoning chains

Large Language Model

Transformers English

Meta Llama 3.3 70B Instruct AWQ INT4

Llama 3.3 70B Instruct AWQ INT4 is the 4-bit quantized version of the Meta Llama 3.3 70B Instruct model, optimized for multilingual dialogue use cases and text generation tasks.

Large Language Model

Transformers Supports Multiple Languages

Llama 3 8B Instruct QServe G128

Llama 3 is the next-generation open-source large language model introduced by Meta, featuring enhanced performance and broader application scenarios.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase